Systems and methods for enhanced video books

ABSTRACT

An enhanced video book and a system and method for creating an enhanced video book are described. Artwork and text corresponding to a storyline can be converted into a format that can be animated. A timing is established at which the converted artwork can be displayed, at a pace corresponding to the timing at which the converted text can be read. The converted artwork and/or the converted text are animating, and voice-over narration corresponding to the converted text is generated. The display of the converted artwork or the converted text is adjusted and synchronized with the voice-over narration based on the timing at which the converted artwork can be displayed. Audio is added and synchronized to the converted artwork. The converted artwork, the converted text, the animated or converted artwork, the animated converted text, the voice-over narration, and the audio are combined into an enhanced video book.

CROSS REFERENCE TO RELATED APPLICATION

This application claims the benefit of priority to U.S. ProvisionalApplication 62/938,891, “Enhanced Video Books,” filed Nov. 21, 2019. Theentire contents of U.S. Provisional Application 62/938,891, “EnhancedVideo Books,” are hereby incorporated into this document by reference.

COPYRIGHT STATEMENT

A portion of the disclosure of this patent document contains materialthat is subject to copyright protection. The copyright owner has noobjection to the facsimile reproduction by any individual or collectiveof the patent document or the patent disclosure as it appears in thePatent and Trademark Office patent file or records, but otherwisereserves all copyright rights whatsoever.

TECHNICAL FIELD

This disclosure relates to video books, for example, enhanced videobooks for children.

BACKGROUND OF THE DISCLOSURE

Books are a form of visual media and communication containing text.Books can contain graphics corresponding to the text. People of all agescan read books. Some books, for example, a children's story book, areformed with text and images that can make reading a fun and enjoyableexperience. Some books can be formed on paper with ink. In paper and inkbooks, the text and graphics are static on the pages of the book. Areader, such as a child reading a story book, would merely read the textand look at the graphics in the book. The child reader may read the bookwith help from a parent or other caregiver by visually scanning the textand graphics on a page and then manually turning to the next page toview the text and graphics therein. The reader can visually scan thetext and graphics on a page, initially learning this style by using thereader's finger to follow along with the text and graphics. Another formof books can be electronic books (e-books). E-books can replicate thereading experience that a reader has with paper and ink books. In somecases, a child who reads either a paper and ink book or an e-book canlose interest, become distracted, or otherwise not finish reading thebook.

SUMMARY

This disclosure relates to systems and methods for an enhanced videobook, a protocol for animated read along text, and enhanced closedcaptioning commands. Implementations of the present disclosure include amethod for creating an enhanced video book. The method can includeextracting artwork and text from a physical book and/or a digital book.Extracting at least one of artwork and text from a book can includescanning pages of the book into a user device. In some implementations,the artwork and text can be created where there is no physical book asinput. The method includes converting the artwork and text into a formatthat can be animated or set into motion. Converting the extracted artand text can include breaking up the extracted artwork and text into oneor more layers. The one or more layers can each be animated or set intomotion.

The method includes establishing a timing at which the converted artworkcan be displayed and establishing a pace at which the converted text canbe read. The pace corresponds to the timing. Establishing the timing canbe based on at least one of a length of the physical book, a quantity ofextract artwork, and a quantity of extracted text. Establishing the pacecan be based on at least one of an age group of readers of the physicalbook, a reader skill level, and a quantity of extracted text.

The method includes animating or setting into motion at least oneportion of the converted artwork and animating at least one portion ofthe converted text. At least one animated or set into motion portion ofthe converted artwork can be a character or an object.

The method includes generating voice-over narration, where thevoice-over narration corresponds to the converted text. The methodincludes adjusting display of the at least one animated or set intomotion portion of the converted artwork based on the timing at which theconverted artwork can be displayed and adjusting display of the at leastone animated portion of the converted text based on the pace at whichthe converted text can be read. The method includes synchronizing the atleast one animated portion of the converted text with the voice-overnarration.

The method includes adding audio to the converted artwork andsynchronizing the audio with at least one of the at least one animatedor set into motion portion of the converted artwork, the at least oneanimated portion of the converted text, and the voice-over narration.Adding audio to the converted artwork can include creating audio thatcorresponds to at least one of a storyline of the physical book, theconverted artwork, and the converted text. The audio can be at least oneof music and sound effects.

The method includes combining the converted artwork, the converted text,the at least one animated or set into motion portion of the convertedartwork, the at least one animated portion of the converted text, thevoice-over narration, and the audio into an enhanced video book.

In some implementations, the method can further include delivering,using a defined delivery method, the enhanced video book to a userdevice for playback. Delivering the enhanced video book can includerendering the enhanced video book into a linear video file and exportingthe linear video file based on the defined delivery method. The linearvideo file has a frame rate based on one or more specifications thatcorrespond to the defined delivery method. The user device receives therendered linear video file. The user device can have linear videoplayback capability. The user device can be a mobile phone, a tablet, alaptop, an e-reader, a computer, a projector, an augmented realitydevice, a TV, or any other playback device. Before delivering theenhanced video book to the user device for playback, the enhanced videobook can be processed through a content delivery network. The defineddelivery method can be a streaming or broadcast delivery method.

In some implementations, delivering, using the defined delivery method,the enhanced video book to a user device for playback can furtherinclude storing the rendered linear video file in a database. Then, therendered linear video file can be retrieved from the database based onreceiving a playback request from a second user device. The renderedlinear video file can be delivered to the second user device forplayback.

The method for creating an enhanced video book can further includedetermining one or more pauses in at least one portion of the enhancedvideo book and generating one or more prompts that correspond to the oneor more pauses and a storyline of the enhanced video book.

Further implementations of the present disclosure include a system forcreating an enhanced video book. The system includes a first device anda second device.

The first device has multiple editing applications. The editingapplications provide multiple user interfaces for display at the firstdevice. A user provides a user input to the user interfaces of theediting applications. The first device, based on the user input,extracts artwork and text from a physical book. In otherimplementations, the first device, based on the user input, can generateartwork and text for a storyline. In other words, the user input can bea storyline instead of the physical book. The first device, based onuser input, then converts the artwork and text into a format that can beanimated or set into motion, establishes a timing at which the convertedartwork can be displayed, and establishes a pace at which the convertedtext can be read, wherein the pace corresponds to the timing. Next, thefirst device, based on user input, animates or set into motion at leastone portion of the converted artwork; animates at least one portion ofthe converted text, and generate voice-over narration. The voice-overnarration corresponds to the converted text.

The first device, based on user input, adjusts display of the at leastone animated or set into motion portion of the converted artwork basedon the timing at which the converted artwork can be displayed. Next, thefirst device, based on user input, adjusts display of the at least oneanimated portion of the converted text based on the pace at which theconverted text can be read. Subsequently, the first device, based onuser input, synchronizes the at least one animated portion of theconverted text with the voice-over narration.

Then, the first device, based on user input, adds audio to the convertedartwork. Following adding audio to the converted artwork, the firstdevice, based on user input, synchronizes the audio with at least one ofthe at least one animated or set into motion portion of the convertedartwork, the at least one animated portion of the converted text, andthe voice-over narration.

The first device, based on user input, combines the converted artwork,the converted text, the at least one animated or set into motion portionof the converted artwork, the at least one animated portion of theconverted text, the voice-over narration, and the audio into an enhancedvideo book.

In some implementations, the first device, based on user input, canrender the enhanced video book into a linear video file and export thelinear video file for playback at the second device. The linear videofile can have a frame rate based on one or more specifications thatcorrespond to a defined delivery method of the second device. The linearvideo file can be based on the defined delivery method. In someimplementations, first user device can further store the rendered linearvideo file in a database. The first user device, can then retrieve therendered linear video file from the database based on receiving aplayback request from a third user device. Next, the first user devicecan deliver the rendered linear video file to the second user device forplayback.

The second device has a graphical user interface display. The seconddevice receive, from the first device, the enhanced video book and playsthe enhanced video book on the graphical user interface display.

Further implementations of the present disclosure can include anenhanced video book. The enhanced video book includes animated textdisplayed in the enhanced video book at one or more first timecodes. Theenhanced video book includes animated artwork displayed in the enhancedvideo book at one or more second timecodes. The enhanced video bookincludes voice-over narration corresponding to the animated text. Thevoice-over narration plays at the one or more first timecodes. The oneor more first timecodes and the one or more second timecodes aresynchronized at a pace at which the animated text can be read and theanimated artwork can be viewed. In some implementations, the enhancedvideo can include at least one of music or sound effects that aresynchronized with the one or more first timecodes and the one or moresecond timecodes.

Implementations of the present disclosure can have one or more of thefollowing advantages. For example, readers can experience a book (e.g.,material) in new and appealing ways that increase a number of sensoryfeeds. By stimulating different senses, reader interest in the book canincrease. Increasing the number of sensory feeds by animating the bookcan capture and maintain a child's attention, which can be essential tothe child's development and education. Game-ification of the book'sstoryline can be decreased, which can therefore increase engagement withthe book. This is because fewer distractions may be presented to thereader. In other words, the disclosed systems and methods provide forengaging the reader with the book's storyline without distracting thereader by presenting the reader with interactive, or game-type,elements. The enhanced video books described herein can provide a linerexperience to the reader, which is less distracting than e-books butmore captivating than paper and ink books. For example, e-books maytypically provide a variety of ways to change delivery of a story. Thiscan frustrate the intent of finishing a book from start to end,especially for younger children who may not allow an e-book to finish.As a result, child readers may not get the intended benefits of readingstorybooks in formats such as e-books.

As another example, reader comprehension of a storyline can be improved.Reader tunnel vision, such as zoning out, can be decreased. Enhancedvideo books as disclosed herein can assist younger children in finishingbooks, improving the reading experience, and improving a learningexperience intended the author. Moreover, the disclosed systems andmethods can convey an entire paper book experience without compromisingor removing parts from the book's storyline. Other media formats, suchas videos, short films, or e-books may typically adapt the book'sstoryline. As a result, these formats may not fully integrate text fromthe paper book, which diminishes the reader's ability to take away theintended reading experience.

Moreover, reader comprehension can be improved by implementing protocolfor animated read-along text (e.g., PART), as described throughout thisdisclosure. Presenting linear text can improve reading comprehension forreaders who have difficulty hearing. In some cases, these readers canprefer watching content of a book without audio so as not to disruptother people who are nearby. The disclosed systems and methods canprovide these readers with such functionality. Therefore, these readerscan get the full reading experience.

PART, as described herein, can also assist readers to speed up andimprove a process for learning how to read. Using linear text foreducational purposes can help increase reading skills, such as readingfaster. Hearing a word pronounced and seeing it being pronounced at thesame time can help young or new readers to better understandpronunciation, meaning of words, different vocabulary, and grammar. Thisfeature can also assist any type of reader in learning new languages. Byanimating test using PART, storylines can be more attractive to childreaders, thereby securing their attention for a greater period of timeand improving the overall learning experience.

Contextual reasoning can also be improved using PART and other systemsand methods disclosed herein. This can especially be improved forreaders who prefer adding a text feature to video books. The ability tosee pictures, especially those having additional features such asanimations and popups, can help the reader associate the text that isbeing read with visuals. The reader can develop a greater understandingof the storyline as well as general vocabulary.

The disclosed systems and methods can also make following along in avideo book easier. Youth are taught to follow along in books via afinger, typically the index finger. This traditional method helps thereader keep track of where they are on the page of the book. However, inelectronic and video books, using the index finger can become harder todo. Touchscreen devices can be too sensitive such that the reader mayinadvertently zoom, turn a page, or accidentally click on anadvertisement. This can be problematic for a young reader who islearning to read, a reader who is trying to strengthen reading skills,or a person who is viewing the book for pleasure or entertainment. Usingthe disclosed systems and methods can assist the reader in followingalong without distractions such as having to turn a page in a book oraccidentally pressing something on a sensitive touchscreen interface.The disclosed systems and methods can animate text in real-time, word byword and/or letter by letter, which can increase a reader's awareness ofwhere they are in relation to the audio or narration of the book.Therefore, the reader can follow along without having to use theirfinger or accidentally pressing something on a touchscreen.

Ways to experience a book's material and subject matter can beincreased. The systems and methods described herein can provide forcreating a fun and animated version of words that can be read, spoken,or listened to. As a result, the reader's focus and ability to payattention to the storyline can be increased, thereby increasing learningability and brain development of the reader. Early childhood developmentis essential for a person's long term cognitive and reasoning abilities,along with emotional intelligence. Keeping a child's attention canincrease the growth that the child will encounter. Children enjoy funactivities more, so by increasing the fun of reading using the disclosedsystems and methods, development of the child can also be increased.Reading is a cornerstone to proper early childhood development. Theworld can open up more to a child who is proficient in reading. Parentsand guardians can then allow the child to seek content or learningmaterial that they choose, which typically they find intriguing or fun.Using the disclosed systems and methods, books can become moreintriguing and fun for child readers, which can stimulate the childreaders to read more. The reading experience can be more fun because thedisclosed systems and methods can implement visual sensors that projecttimely and animated read-along text with animated imagery, soundeffects, music, and/or narration. A combination of such features canassist the reader to pick their own pace to read the book withoutassistance from other people or other media formats. Moreover, areader's position on a page can be determined using sensing capabilitiesto then adjust a timing, speed, or pace for presenting the reader withanimated read-along text. As a result, the reader's attention can bemaintained, the learning experience can be improved, and the reader canfind the reading experience fun and intriguing.

Additionally, readers having a reduced ability to see clearly orneurological disabilities can more easily follow animated read-alongtext. For example, if some brain paths are underdeveloped, some sensorypathways can be hindered or functionally reduced. In some of thesecases, animated text can improve the brain's ability to track along withthe material. Therefore, the disclosed systems and methods can assistsuch readers in being able to read while also improving or developingsensory abilities of their brains. Finally, for readers who have visonclarity issues, the disclosed systems and methods provide for usingdifferent colors, animation, strokes, bolding, highlighting, and otherfeatures to assist the reader in visualizing, identifying, and followingtext as it is being narrated.

Details of one or more implementations of the subject matter describedin this specification are set forth in the accompanying drawings and thedescription below. Other features, aspects, and potential advantages ofthe subject matter will become apparent from the description, thedrawings, and the claims.

BRIEF DESCRIPTION OF THE DRAWINGS

FIGS. 1A-1C illustrate an implementation of a manner in which anenhanced video book can be presented to a user.

FIG. 2 illustrates an example of a process for generating and displayingthe enhanced video book of FIGS. 1A-1C.

FIGS. 3A-3E illustrate an example of a protocol for animated read-alongtext for the enhanced video book of FIGS. 1A-1C.

FIG. 4 illustrates an example system and process for generating,delivering, and displaying enhanced closed captioning.

FIG. 5 illustrates example system components for performing one or moreof the processes described herein.

FIGS. 6A-6B is a flowchart of an example process for creating anenhanced video book.

FIGS. 7A-7B is a flowchart of an example process for animatingread-along text.

FIG. 8 is a flowchart of an example process for generating enhancedclosed captioning commands.

FIG. 9 illustrates another example of a protocol for animated read-alongtext for the enhanced video book of FIGS. 1A-1C.

Like reference numbers and designations in the various drawings indicatelike elements.

DETAILED DESCRIPTION

This disclosure relates to enhanced video books, for example, booksdelivered electronically for display on a user device such as a tabletcomputer, smartphone, laptop computer, or the like. Enhanced video bookscan animate each page in real-time in a seamless manner such thatgraphics, audio, and text may be generated in front of a user's eyes asthe user views the book. The text can be emphasized as the user viewsthe book to assist the user in following along as the text is beingnarrated. For example, a line of text having multiple words can bedisplayed and successively emphasized at a pace of human speech. As anexample, each word in the line of text can be displayed in a firststate. Then each word can be emphasized and displayed in a second stateas having an outline. The emphasized word can then be displayed in athird state as heavier-weighted text. The emphasized word can also bedisplayed in a fourth state as regular text. The words can be visuallyemphasized one at a time such that a first word is displayed in thethird state when a second word is displayed in the second state and thefirst word is displayed from the third state to the fourth state whenthe second word is displayed from the second state to the third stateand a third word is displayed in the second state. As another example,the text can be displayed as enhanced closed captioning in a video. Thevideo can have multiple frames and a user can define an appearance and alocation of text to be displayed in each of the video frames. Theappearance and the location of text can be synchronized with each of theframes of the video. A delivery packet can then be generated thatincludes a design packet, the video, a video timecode, a text timecode,and enhanced closed captioning commands. The delivery packet can beprovided to a user device for seamless playback.

Referring to the figures, FIGS. 1A-1C illustrate an implementation of amanner in which an enhanced video book can be presented to a user. FIGS.1A-1C depict three successive scenes to illustrate gradual animationsthat are seen by the user to provide a smooth and seamless readingexperience. FIG. 1A is a scene 100 of a princess character 101 lightlymoving via animation. The scene 100 also includes a bird 102 shownflapping its wings as it moves slightly up and down via animation.Semi-transparent text 103 appears on the screen 100. Thesemi-transparent text 103 corresponds to words that can be narrated by avoiceover narration.

FIG. 1B shows an updated scene 100′ in which a prince character 104 isnow added to the scene 101′, also lightly moving via animation. The bird102 has moved and continues flapping its wings. The princess 101, theprince 104, and the bird 102 represent original artwork from a storybookthat are now set in motion with animation. As described throughout thisdisclosure, a developer can set the storybook into motion using one ormore known software tools or applications. The developer can lightlyanimate the images 101, 104, and 102 to stimulate the reader and keepthe reader's attention without distracting the reader from continuing tofollow along with a storyline.

At this point in the scene 100′, the narrated voiceover has begun, andhas narrated the words “The Fairy” 105. The voiceover is just startingto narrate the word “Godmother's” 106. The narrated words (“The Fairy”105) have transitioned from semi-transparent text, as insemi-transparent text 103 (shown in FIG. 1A), to now fully opaque textas shown in updated scene 100′ “The Fairy” 105. The word in the processof being narrated, in this instance “Godmother's” 106, demonstrates theprocess of extra emphasis being added to the semi-transparent text 103,synchronized with the voiceover to emphasize each word as it is narratedor spoken. The voiceover narration and animated text can be paced tomatch a range of beats per minute as defined by elements, for example,such as the original storybook's characteristics, a targeted agedemographic, and thematic elements. The developer can determine how toemphasize each word and synchronize such animations with the voiceovernarration. For example, as described further below, the developer canchoose to emphasize the words “The Fairy” 105 as semi-transparent textonce the voiceover completes reading those words. This type of emphasiscan cause the reader to follow the other words in the line of text asthey are emphasized, thereby reducing distractions in the scene 100′ andassisting the reader to continue reading and following along with thestoryline.

Lastly, FIG. 1C shows a scene 100″ in which the princess 101, the prince104, and the bird 102 continue to move slightly with animation. In thisexample, the voiceover narration has continued and progressed throughmore of the semi-transparent text 103 (refer to FIG. 1A). At this point,the voiceover has narrated the text “The Fairy Godmother's magic soonfaded. And Cinderella again had nothing to wear. But the” 107. The word“prince” 108 is currently in a process of being narrated, and thereforeis being emphasized in another state of emphasis that is synchronizedwith the voiceover narration. In FIG. 1C, previously narrated textremains opaque after it is narrated (e.g., the text 107). In otherimplementations, the developer can choose one or more different statesof emphasis to apply to each word, each letter, each phrase, and/or eachsyllable in the line of text. As a result, the developer can create anenhanced video book from an original storybook having unique,customized, and dynamic integration of animations, audio, sound effects,and other visualizations. The enhanced video book can assist the readerin maintaining their focus on reading and improving an overall learningexperience.

FIG. 2 illustrates an example of a process 200 for generating anddisplaying an enhanced video book 220 of FIGS. 1A-1C. The enhanced videobook 220 (e.g., EVB) can be a compilation of one or more different mediaformats and features. For example, the EVB 220 can be a digital,non-interactive, linear reproduction of a book 202. The book 202 can bea typical bound paper and ink book. In other implementations, the book202 can be a typical electronic book (e.g., e-book). Text 206 andartwork 204 (and/or illustrations) of the book 202 can be extrapolatedusing known techniques and/or software application or tools. In someimplementations, the book 202 (e.g., physical book) may not be used inthe process 200. Instead, a user, such as an author or writer, canprovide a storyline to a developer. The developer can then generate text206 and artwork 204 for that storyline using the process 200. In otherwords, the developer can build original characters, images, scenes, andtext around the storyline provided by the user, even if the user doesnot have the storyline in a physical book format (e.g., the book 202).In yet other implementations where the physical book 202 is not used inthe process 200, the user can provide the developer with animated videoas input for the EVB 220. The developer can then build a storyline, text206, and/or narration for the received animated video using the process200 and one or more other techniques described herein.

The text 206 and/or artwork 204 can be animated and/or set into motionby a developer in creating the EVB 220. A developer can add additionalmedia and features to the text 206 and artwork 204 to further enhance oranimate the EVB 220. For example, music 208, sound effects 210 (e.g.,SFX), animation(s) 213 (e.g., the text 206 and/or the artwork 204 can beanimated), and different language 212 can be added to the EVB 220. Thesefeatures can be added to one or more parts of the EVB 220 in acomposition process 214. The composition process 214 can be done by thedeveloper using known techniques and/or software application or tools.For example, the developer can use a suite or package of animationsoftware to bring together one or more features of the EVB 220.

Once the EVB 220 is generated in the compilation process 214, the EVB220 can be packaged for delivery via a content delivery network 216(e.g., CDN). The EVB 220 can be delivered to a device screen 218 forplayback. The device screen 218 can be part of any suitable digitalmeans, such as a mobile phone, smartphone, mobile application, laptop,computer, e-reader, digital broadcast, etc.

Enhanced video books 220 are digital, page-for-page, linear versions orreproductions of books with minimal interactivity, and are designed tobe streamed like videos or games. The EVB 220 can be delivered in videoformat. The EVB 220 provides the reader with limited functionality ascompared to an e-book. The reader of an EVB 220 can be limited to stop,play, rewind, skip, and select language (voice and copy) functions.Other than these functions, the enhanced video book 220 can be free ofmanual interactivity by the reader. As a result, the reader can beengaged in following a storyline of the EVB 220 without being distractedby too many interactive elements. The EVB 220 can be streamed and/ordownloaded via the Internet or mobile applications, thereby making iteasier for the reader to take the EVB 220 with them and read at any timethat the reader desires. The pacing of the EVB 220 can be intentionallyslower to mimic a parent reading to a child. This can be advantageous toassist the reader in learning how to read, learning how to pronouncewords, and learning vocabulary.

The process 200 of conforming the original artwork 204 and the text 206of the book 202 to create the EVB 220 can involve use of technicalmethods of animating and rendering one or more features of the EVB 220,as described herein. Input to the process 200 can be the conventionalbook 202, for example, a children's storybook or picture book havingtext and graphics (e.g., still images). From the book 202, the artwork204 and text 206 can be extracted and converted to a digital format. Aviewer of the EVB 220 can look back at the original book 202, from whichthe enhanced video book 220 is created, and see a direct parallelbetween both. In other words, all the artwork 204 and the text 206 fromthe book 202 are incorporated into the EVB 220. The developer does notadapt or remove content from the book 202, thereby ensuring that astoryline of the EVB 220 remains true to a storyline of the book 202. Asa result, the viewer of the EVB 220 can get a full reading experiencethat an author of the book 202 intended. The composition process 214 ofconforming the original artwork 204 to video specifications andstandards can include use of one or more technical methods of animatingand rendering out the EVB 220 for use in streaming or other linear videodelivery methods and platforms.

Still referring to FIG. 2 , the artwork 204 can be original artwork fromthe book 202. The original artwork 204 can include text and/or imagesthat make up the book 202. The artwork 204 may be altered or amended asneeded to account for 1) changes in formatting that may arise fromformatting for video versus original formatting of the artwork 204, 2)changes resulting from animating the artwork 204, or setting it inmotion, or changes necessitated by animation or motion that requirealtering or amending the original artwork 204 to accommodate for suchanimation or motion, and/or 3) changes resulting from a process ofconverting the original artwork 204 into a form to be animated or set inmotion. For example, the original artwork 204 can be broken andflattened into layered pieces of the artwork 204. As a result, each ofthe layered pieces can be more easily and individually animated or setin motion by the developer. Any such changes can be made in a similarstyle or character of the original artwork 204 such that the EVB 220parallels the original book 202.

As described above, in some implementations, the artwork 204 may not befrom a physical book, such as the book 202. Instead, the artwork 204 canbe created by the developer based on a storyline received from the user(e.g., author or writer). The artwork 204 can also be a video oranimated images, rather than artwork from the physical book.

The developer of the enhanced video book 220 can generate additionaldigital content such as the music 208, sound effects 210, animations213, and/or additional graphics. The developer of the enhanced videobook 220 can also specify one or more languages 212 in which the text204 of the enhanced video book 220 can be displayed and/or narrated.

As mentioned, the book 202's artwork 204 can contain text 206. The text206 can be animated in a read-along fashion so as to mimic a process ofreading. As an example, animating the text 206 can be accomplished byhighlighting each word, one at a time, in synchronization with voiceovernarration of the same text 206. The animated read-along text can followone or more different formats of the developer's choosing. In someimplementations, the animated read-along text can follow a format bywhich 1) semi-transparent text appears on screen and 2) voiceover beginsnarrating the same semi-transparent text while 3) the semi-transparenttext is transformed word by word, in sync with the voiceover narration.As a result, the same word being spoken by the voiceover can beemphasized, thereby transitioning to a fully opaque state. This type ofanimated read-along text can be beneficial to assist the reader inlearning how to read, following along, and maintaining focus or interestin a reading experience.

As mentioned, voiceover narration of the text 206 can also be includedin a read-along fashion. The voiceover narration can be synchronizedwith animated read-along text. The voiceover narration can match inpacing, theme, and tonality of the original book 202, and can also beadjusted for the book's age demographic. Moreover, one or more voiceovernarrations can be provided in different languages 212, such that thereader can learn a different language or read the book 202 in a languageof the reader's preference (e.g., the book 202 can be written in Englishbut the reader only knows Spanish, so the voiceover narration language212 can be Spanish).

The animated artwork 204 and the text 206 can be paced. That is, theanimated artwork 204 and the text 206 can be deliberately set to mimic apace at which the book 202 would be read. Some variation in pacing canoccur, depending on various factors such as 1) an age group the book 202is meant for (e.g., variations between books intended for 2-4 year oldsversus books intended for 6-8 year olds, etc.), 2) comprehensionstandards as a result of content or a storyline of the book 202, and/or3) thematic elements within the book 202.

The music 208 can be synchronized to the artwork 204 to further enhancean experience of the reader viewing the enhanced video book 220. Themusic 208 created or added can be aligned with pacing, theme andtonality of the original book 202 and whatever additional features, suchas animations, are added to the EVB 220. The music 208 can align to thebook 202's age demographic, thematic elements within the book 202,and/or tonal elements. The music 208 can audibly represent the storylineor visuals of the original book 202. Adding the music 208 to one or moreportions of the EVB 220 can make the reading experience more engagingand maintain the reader's interest and focus without being distracting.

One or more sound effects 210 can be synchronized to different elementsof the EVB 220 to further enhance the experience of reading or viewingthe EVB 220. The sound effects 210 can be used in a manner to work inconcert with, and further add interest and life to, the animations 213,the artwork 204, and the text 206.

An overall timing of the enhanced video book 220 can be determined byone or more of factors mentioned above. Consideration can be given towords per minute, as relating to the voiceover narration 212, and beatsper minute, as relating to the music 208. The pacing of the voiceovernarration 212 and the music 208 can work in concert and may bedetermined by various factors, including an age demographic of the book202, comprehension level of an intended audience, and thematic elementsof the book 202.

The enhanced video book 220 can be linear video played at one or moredifferent frame rates. The frame rate can be a certain number of framesper second, played in sequential order to create a persistence of visionor motion perception by the viewer. The developer of the EVB 220 candetermine an appropriate frame rate that provides for a seamless,interactive, and well-paced display of the storyline of the EVB 220.

The animations 213 can be from the book 202's artwork 204, whichcontains imagery. Example imagery includes illustrative, photographic,digital, or graphical artwork. The artwork 204 can be animated and setin motion by the developer to emphasize the original artwork 204,enhance its visual appeal for video format, and maintain an adherence tothe look and intent of the original artwork 204. This can includesetting the artwork 204 as a whole in motion or breaking the artwork 204into parts, with motion that is selectively added. The artwork 204 canappear to come to life with the animations 213, which can make readingthe EVB 220 more attractive to the reader. The animations 213 or motioncan differ from traditional animation because the animations 213 can besubtle while still maintaining a quality of the original artwork 204.Therefore, the reader may not be distracted by too much animation andthe reader can have the full reader experience intended by the author ofthe original book 202.

The various elements 204-213 can be combined by the developer in anappropriate manner using the composition process 214. The compositionprocess 214 may be accomplished using an integrated compositionenvironment and/or using specialized software tools or applications tocreate the enhanced video book 220. Individual off-the-shelf softwaretools can be used in the composition process 214. For example, a rastergraphics editor and/or a vector graphics editor may be used by thedeveloper to generate or modify animations, graphics, and/or artwork.The outputs of these editors or software tools can then be combinedusing, for example, a digital visual effects, motion graphics, andcompositing application(s) to generate the enhanced video book 220. Insome implementations, the artwork 204 (e.g., imagery) within the book202 can be animated and set in motion to create the animations 213 usingone editing application. The music 208 can be added, paced, and/or timedto match timing derived from the book 202 using one or more otherediting applications. These components or features can then besynchronized to the video picture using additional editing applications.The voiceover narration 212, described later, can be synchronized toanimated read-along text using additional editing applications. Thesound effects 210 can also be generated and designed to be synchronizedand match the artwork 204 and other components of the EVB 220 usingediting applications. The Enhanced video book 220 file can then berendered and exported as a linear video file having a frame rate basedon certain specifications and standards dependent on a delivery method.This step can also be performed by another editing application.Therefore, the EVB 220 can be fully customized using one or more editingapplications of the developer's choosing.

The text 206 can be displayed in the enhanced video book 220 along witha corresponding audio tract, such as closed captioning. For example,closed captioning text may not be animated in a word-for-word fashion.The closed captioning text can be enhanced to appear in groupings oflines or sentences that are more interactive and/or engaging to thereader, as described further below.

Still referring to FIG. 2 , the enhanced video book 220 can be deliveredto an end-user's device screen 218 via the content delivery network 216(e.g., video content delivery network, broadcast, CDN). The end user(e.g., reader) can view the EVB 220 on the device screen 218. The devicescreen 218 can be part of a digital device with linear video playbackcapability. As discussed earlier, the enhanced video book 220 can bestreamed (e.g., from a cloud or other remote database), broadcasted,and/or downloaded to the end-user's device.

FIGS. 3A-3E illustrate an example of a protocol for animated read-alongtext for the enhanced video book of FIGS. 1A-1C. The example protocolfor animated read-along text (PART) can be applied to each word insuccession. PART is a digital word for word reproduction of verbatimtext of published storybooks, picture books, or other books, wherein thereproduced words can be delivered as a video format. PART can beaccompanied by the artwork 204 or animations 213 (still or moving),voiceover narration 212, the sound effects 210, and/or the music 208, asdescribed in reference to FIG. 2 . Although some of these elements maynot be suitable for, or intended for use in traditional motion pictures,film, short film, videos, audio books, or e-books, they can be used toprovide more enhanced and visual books to readers (e.g., the enhancedvideo book 220 in FIG. 2 ). FIGS. 3A-3E depict five different visualstates in connection with words as they are being spoken. One or moreother emphasis states can be created and applied to words as they arebeing spoken.

As depicted, first, in State 1 (FIG. 3A), a line of text (“a giraffegone quackers”) appears. The first word in the text string (“a”) hasalready been spoken so the first word appears in State 5 (e.g.,unbolded, opaque font). The remainder of the text in FIG. 3A (“giraffegone quackers”) has not yet been spoked so it is displayed in State 1format, namely, translucent (e.g., lower than 100% opaque). Therefore,the text can be partially see-through but still readable and legible.This more translucent text matches and represents a visual version ofwords that are not yet spoken by voiceover, or heard via audio, butforthcoming in an EVB, video, e-book, or other digital linear format.Where there is no voiceover or audio narration with the text, thetranslucent text can appear sequentially at a pace at which the readeris expected to read the line of text.

Next, in State 2 (FIG. 3B), the word being spoken (“giraffe”) isdisplayed as outlined with a translucent stroke (e.g., the outline has adifferent opacity than that of State 1) encompassing the word as it isbeing spoken. This sort of animation can assist the reader in followingalong at the pace at which the word is being read or spoken. Next, inState 3 (FIG. 3C), the outline encompassing the word being spoken(“giraffe”) becomes slightly darker (e.g., more opaque) than in State 2.This state can occur while the word is being spoken to help emphasizethe word and/or syllables therein. Next, in State 4 (FIG. 3D), theoutline encompassing the word being spoken (“giraffe”) resolves to afully opaque, slightly bolded word. This state can indicate that theword has been read or spoken. Lastly, in State 5, the word being spoken(“giraffe”) resolves completely, e.g., into fully opaque, unbolded text.This state can indicate that the word has already been spoken and thewords following “a giraffe” are about to be read or spoken. Thus, theemphasis described in relation to the words “a giraffe” can be repeatedfor each subsequent word in the line of text. Although not shown, theprocess of transitioning from State 1 through State 5 can repeat foreach word in the line of text. Once all the words in the line of textare spoken, all the words can appear in the State 5 emphasis.

As mentioned, a narrator's voice can speak in a timed format and PARTcan be used to maintain accuracy and consistency of the voice with theanimated text. When the narrator's voice is used, the voice may read theword that is being altered through animation via PART, eitherimmediately before, simultaneous, or after. Regardless, the pace atwhich the voice reads the words can remain consistent for an entireline, sentence, paragraph, page, video, or overall book. This can assistthe reader in learning how to read at a steady pace.

The narrated words transition from semi-transparent text (State 1), thenfully opaque (State 2), then back to a less opaque version (State 3),settling back to normal (State 4) or semi-transparent (State 5) as thewords are spoken. This word synchronization with narration or voiceovercan be emphasized at a pace meant to match a range of beats per minute,as defined by the original author, publisher, or developer. Additionalfactors can be used to determine an appropriate pace to read the wordsand simultaneously emphasize the words. Those factors can include astoryline of the book, purpose or theme of the book, intended audienceof the book, and/or purpose of reading the book (e.g., learning a newlanguage). The animated text pace can be adjusted based on various otherfactors. For example, the animated text pace can be adjusted based onvisual recognition, fixed narration speed, adjustable narration speed, aread-back function adjustment, or a reading level or skill of thereader.

Although FIGS. 3A-3E show an example of text animation involving fivesuccessive states, any quantity of states may be used and decided by thedeveloper to suitably emphasize or otherwise draw attention to each wordas it is being spoken. Such customization in word emphasis can beadvantageous to assist a reader in learning how to read, how topronounce words, how to pace, and/or how to read faster. In addition,the change states can be indicated in any manner that conveysdynamically which word is being spoken. For example, different colors,fonts, underlining, shading, cross-hatching, or the like can be used forthe various States 1-5 or any other emphasis states. Alternativelyand/or additionally, individual letters within a single word can bevisually emphasized (e.g., transition through States 1-5) and/or anentire string of words can be visually emphasized simultaneously as thestring is being spoken. Emphasis of words in the line of text can alsobe advantageous to assist the reader in maintaining focus and interestin reading the enhanced video book.

FIG. 9 illustrates another example of a protocol for animated read-alongtext for the enhanced video book of FIGS. 1A-1C. States 1-5 in FIG. 9can be similar and/or different to the States 1-5 as depicted anddescribed in reference to FIGS. 3A-3C. As shown in FIG. 9 , first, instate 1, a line of text (“Example read-along text animation”) appears.The first word in the text string (“Example”) has not been spoken so itis displayed as outlined with a translucent stroke, which differs fromstate 5 (e.g., unbolded, opaque font). The remainder of the text instate 1 (“read-along text animation”) also has not yet been spoken so itis displayed in state 1 format (e.g., translucent, or less than 100%opaque). Therefore, the text can be partially see-through, yet still bereadable and legible, as previously described.

Next, in state 2, the word being spoken (“Example”) is displayed asoutlined with a translucent stroke (e.g., the outline has a differentopacity than that of state 1) encompassing the word as it is beingspoken. Next, in state 3, the outline encompassing the word being spoken(“Example”) becomes slightly darker (e.g., more opaque) than in state 2.Next, in state 4, the outline encompassing the word being spoken(“Example”) resolves to a fully opaque, slightly bolded word. Lastly, instate 5, the word being spoken (“Example”) resolves completely (e.g.,into fully opaque, unbolded text). Although not shown, the process oftransitioning from state 1 through state 5 can repeat for each word asit is spoken, until all of the text in the string has been spoken orread.

In the manner described throughout this disclosure, as voiceover oraudio is heard, matching translucent text can animate (e.g., word byword, letter by letter) to become fully opaque. The animation to 100%opaqueness can be synchronized to match timing of the audible words, sothat as a word is heard or represented audibly, the same wordtransitions from translucent to opaque. This animation can beadvantageous to assist the reader in reading and learning. Synchronizedanimation of the text's opacity can be accompanied by additional oralternate animation to add emphasis to each word or letter as it isaudibly heard. Examples of additional or alternate animations to addemphasis may include (i) bounds of the text expanding outwardmomentarily before contracting back again to its original size, (ii)weight of the text changing momentarily before returning to its originalweight, (iii) color of the text changing, (iv) an outline or strokebeing added to the text, and/or (v) any combination thereof.

In each of the above examples, extra animation applied for emphasis canbe applied word by word, or letter by letter, in a synchronized fashion,so as to emphasize the specific word or letters being audibly heard at agiven moment. For instances where no words are heard or presentedaudibly, text animation (e.g., translucency and additional or alternateanimation for emphasis) can occur in a sequential order, so as tovisually mimic the text as it can be spoken or read. This can assist thereader in establishing a steady pace to read the text.

In reference to FIGS. 3A-3C and FIG. 9 , rather than one word beingdisplayed and spoken at a time before the next word is displayed andspoken, an entire sentence (or paragraph or page) of text can bepresented to the reader. In such an example, a visual appearance of eachwords forming the sentence can be altered as that word is spoken. Thiscan provide the reader with an engaging “read-along” experience. Thisprotocol (e.g., PART) can be applied not only to enhanced video books,but also to video segments, e-books, or any other digital linear formatwhere text appears on the screen and is timed in connection with, and/orsynchronized to, voiceover or some other audio. Alternatively, when novoiceover or audio is present, or if the sound is muted or otherwise notpresent, text can appear emphasized on the screen at a pace or timingthat is intended to simulate a spoken progression of the text.

Features used in media may include but are not limited to underline,highlight, full bold, etc. PART incorporates more features to text thatcan benefit the reader in reading, learning how to read, focusing on astoryline, and finding their position in the text. Protocol for animatedread-along text can display text along with a corresponding audio track,which is different than traditional closed captioning. Traditionalclosed captioning text may not be animated in a word for word or letterby letter format. For example, closed captioning, which is the standardfor video formats, can provide entire lines, sentences, or paragraphs onthe screen without additional animation or emphasis. As a result, thereader can have trouble knowing their position in the text or followingalong as the text is being read. Animating text using PART, as describedherein, can improve different forms of media content that includesubtitles or digitally written words, whether the media form iscomprised of motion pictures or free of such.

Read-back functionality with animated read-along text can further assistthe reader in improving their learning experience. Read-backfunctionality can use a microphone of the reader's device to listen tothe reader as they read aloud text that is being visually emphasized.Reading aloud without assistance can also be used to test the reader'sreading accuracy and speed amongst other readers. Again, young readersfind difficulty following small font and/or dense text. Animated textcan assist such young readers to follow the text outside of merelyentertainment purposes. PART can therefore be used to improveeducational and entertainment purposes of the enhanced video book. Whenimplemented, the animated text can capture the reader's attention andassist them in maintaining and/or finding their position in the text.

Read-back functionality can be provided at different paces, as describedthroughout this disclosure. For example, the pace can be based on anarration, speed at which the reader is expected to read the book, aspeed that the reader selects, and/or at a rate that a camera (e.g.,front-facing) on the reader's device senses the reader's eyes are movingacross the page. The rate of eye movement can be based on eye placementon a page/screen and text position.

The animated text can appear in conjunction with an adjustable readingspeed that can be set by the reader or another user (e.g., a parent orteacher of the reader). Displaying animated text and narrating at thesame time with the ability for one to adjust the speed adds manybenefits, as described throughout this disclosure. One may not likelylisten to an audio book at 3× speed if their brain cannot decipher allthe words that are spoken and maintain an understanding of story.Therefore, the reader can select a different speed, such as 1.5×.Setting speed for animated text allows the reader to speed up or slowdown animated text and narration, thereby making the read more enjoyableand engaging for the specific reader. A progression in chosen speed overtime can also indicate that the reader is developing their learning andreading skills.

FIG. 4 illustrates an example system and process for generating,delivering, and displaying enhanced closed captioning (ECC). As shown,the ECC system includes three components: a Build/Programming GUI 400, adelivery system 402, and a playback platform 404. The Build/ProgrammingGUI 400 can provide an environment for a content creator (e.g.,developer as described throughout this disclosure) to generatecustomized ECC text having a desired appearance (e.g., font, color,effects) and placed at a desired location in the media. The media can bean enhanced video book, as described herein, or other media formats,such as videos. In the example shown, the content creator has specifiedthat the ECC text “Designed Text” is to appear at location 406 within ascreen space 408. In addition, the content creator can specify a texttimecode 410 for each item of text, thereby time-synchronizing the ECCtext with accompanying video content.

Output of the Build/Programming GUI 400 can be an ECC delivery packet402, which includes video 414, design packet 412, video timecode 415,ECC commands 416, and packaged audio 418. The ECC commands can alsoinclude the text timecode 410, which the content creator determined inthe Build/Programming GUI 400.

The ECC delivery packet 402 can be parsed via a playback platform 404for display on a screen of the playback platform 404. The playbackplatform 404 can be a reader's device, such as a mobile phone,smartphone, tablet, computer, laptop, TV, e-reader, projector, augmentedreality device, or any other type of device having linear video playbackfunctionality. The design packet 412 can be transcoded with the video414 and/or stored on a server (e.g., cloud or other remote database) andretrieved during transmission of the delivery packet 402 to the playbackplatform 404. The design packet 412 can then be parsed via the playbackplatform 404 for streaming. The design packet 412 can include font,location, and/or animation of the text 406. These features in the designpacket 412 can be based on the ECC commands 416, which are synchronizedto the video timecode 415 and the text timecode 410. Moreover, in someimplementations, the delivery packet 402 can pull or retrieve from acloud server or other database content such as the video 414 that iswrapped in the delivery packet 402.

Conventional closed captioning is a process of displaying text on atelevision, video screen, or other visual display to enable hearingimpaired viewers to understand what words are being spoken (and/or whatsounds are being made) in a displayed scene. Such conventional closedcaptioning techniques typically are limited to a minimal font set and anautomatic, fixed placement of text on the screen (e.g., on the bottomthird of the screen). The content creator may not have control to changethe font set or text placement. As described herein, ECC allows for thecontent creator to customize text placement and font selection, which inturn allows for creative, template, and custom-designed layouts formulti-language closed caption playback of content. ECC can therefore beadvantageous to improve a viewer's experience in reading or viewing thetext during video playback.

Once the content creator makes such design choices and coded them into apredefined format, a resulting package (e.g., the delivery packet 402)can be distributed via closed caption protocols using custom tags andcalls. The ECC can then displayed on the playback platform 404's screenalong with corresponding video content (e.g., the video 414).De-coupling customized ECC from the video content 414 in this manner canprovide the content creator with great flexibility in determining whereon the screen the captioned text 406 should be displayed and what itshould look like (e.g., font selection). When working with ECC,placement of the text 406 can be made based on a per-shot or framebasis. Therefore, the text 406 may not be limited to predeterminedlocations on the screen, such as a lower third portion of the screen.ECC is fully customizable and defined by the creator of the contentrather than a close captioning system. There is no limit to placement ofthe text 406 on the screen since the content creator can fully customizeand design placement tags that map the ECC to any screen and videoresolution. The placement tags can be synced via the video timecode 414and/or the text timecode 410 through a custom dashboard by the contentcreator. As a result, the content creator can generate a more customizeddisplay of text with video content. ECC as described herein can be usedin conjunction with one or more other systems and methods as describedherein, such as PART and the enhanced video book.

ECC also provides for multi-language support with user-selectablelanguage playback based on the content creator's designed layouts in anoriginal language. ECC also provides for detailed timecode word trackingor synchronization, which allows for per-syllable and/or per-wordanimation based on timecode of the video and/or the text.

FIG. 5 illustrates example system components for performing one or moreof the processes described herein. As depicted, a computer system 502, aplayback device 504, and a content delivery system 550 can be incommunication (e.g., wired and/or wireless) via network(s) 500. Thecomputer system 502 can include input device(s) 508, output device(s)510, processor(s) 512, editing tools 514, and a communication interface528. The input device(s) 508 can be a touchscreen, keyboard, mouse,microphone, or any other similar device that is configured for receivinguser input. The output device(s) 510 can be a display screen. In someimplementations, the input device(s) 508 and/or the output device(s) 510can be part of a computing device separate from the computer system 508.That computing device can be in communication with the computer system502 via the network(s) 500. The example computing device can be atablet, laptop, computer, smartphone, or other mobile device.

As depicted in FIG. 5 , the computer system 502 also includes theprocessor(s) 512, which can be configured to perform one or moreoperations as described throughout this disclosure.

The editing tools 514 can include an artwork processor 516, an animationengine 518, a timing and pacing engine 520, an audio engine 522, asynchronization engine 524, and a rendering engine 526. One or moreother or additional editing tools can be included and/or omitted. One ormore of the editing tools 514 can also be off-the-shelf software toolsor applications as described herein. In some implementations, one ormore of the editing tools 514 can be stored in a cloud or other databaseand accessed by the computer system 502 via the network(s) 500.

The one or more editing tools 514 can be displayed at the outputdevice(s) 510 of the computer system 502. The computer system 502 canthen receive user input from the input device(s) 508 that is associatedwith the displayed editing tools 514. For example, the user can bedeveloping an enhanced video book. On the output device 510 (e.g., adisplay screen), the user can select the artwork processor 516. Theartwork processor 516 can be an existing software tool or off-the-shelfapplication. The artwork processor 516 can be displayed on the outputdevice 510 and the user can then provide input to the artwork processor516 via the input device 508. In some implementations, the artworkprocessor 516 can be configured to receive scanned images of pages of aphysical book such that the physical book can be converted into anelectronic book format. Thus, the user can scan the pages of thephysical book using a scanner (e.g., the input device 508), which isthen received and processed by the artwork processor 516. The artworkprocessor 516 can convert the scanned pages of the physical book intoeditable artwork and text.

The user can also provide input to the animation engine 518. Theanimation engine 518 can be an existing software tool or off-the-shelfapplication. The animation engine 518 can be configured to receive userinput indicating placement, appearance, motion, and/or animation of oneor more of the converted artwork and the converted text, as describedthroughout this disclosure. The animation engine 518 can then animatethe converted artwork and/or the converted text based on the user input.

The user can also provide input to the timing and pacing engine 520. Thetiming and pacing engine 520 can be an existing software tool oroff-the-shelf application. The engine 520 can be configured to receiveuser input indicating a pace at which the converted text can be read, atiming at which animations of the converted text or the convertedartwork can occur, and other timing and pacing features as describedthroughout this disclosure. The timing and pacing engine 520 canaccordingly pace or time one or more features of the enhanced videobook.

User input can also be provided to the audio engine 522. The audioengine 522 can be an existing software tool or off-the-shelfapplication. The engine 522 can be configured to receive user inputindicating music or sound effects to add to the converted artwork andtext. The user input can also include voice-over narrations of theconverted text. In some implementations, the user can generate or createthe music, sound effects, and/or voice-over narrations. In otherimplementations, the user can select the music and/or sound effects froma library of audio clips or files. The library of audio clips or filescan be provided by the audio engine 522 and/or stored in a cloud orother database and accessible through the network(s) 500. The engine 522can then add the selected voice-over narrations, music, and/or soundeffects to user designated portions of the enhanced video book.

User input can also be provided to the synchronization engine 524. Thesynchronization engine 524 can be an existing software tool oroff-the-shelf application. The engine 524 can be configured tosynchronize or align the converted artwork, the converted text,voice-over narrations, music, and/or sound effects, and any animations,as described throughout this disclosure. For example, thesynchronization engine 524 can match up voice-over narrations withanimated text to provide for read-along capabilities. As describedthroughout this disclosure, the synchronization engine 524 can alsosynchronize a video, video timecode, audio package, and enhanced closedcaption commands into a delivery packet. The delivery packet can then betransmitted to the playback device 504 upon playback request from thedevice 504. In some implementations, the engine 524 can automaticallysynchronize these features of the enhanced video book as they aregenerated by one or more of the editing tools 514. For example, when theanimation engine 518 animates text, the synchronization engine 524 canautomatically synchronize the animated text with any other features ofthe enhanced video book, such as voice-over narrations.

The editing tools 514 can also include the rendering engine 526. Therendering engine 526 can be an existing software tool or off-the-shelfapplication. The engine 526 can be configured to render the enhancedvideo book for playback on the playback device 504. Rendering theenhanced video book by the computer system 502 (e.g., server side)rather than the playback device 504 can be advantageous to ensure theenhanced video book can be quickly streamed or broadcasted at theplayback device 504. In other words, the enhanced video book may notbuffer upon delivery and payback at the playback device 504. Therefore,bigger enhanced video book files can be delivered and played at theplayback device 504. In addition, the playback device 504 can havefaster bandwidth for streaming any size enhanced video book when theenhanced video book is rendered at the computer system 502.

The communication interface 528 can provide for communication betweenany one or more of the components of the computer system 502 with anyother components (e.g., the playback device 504, the content deliverysystem 550) via the network(s) 500.

The computer system 502 can also be in communication with an enhancedvideo book (“EVB”) database 506. The database 506 can be a cloud orother form of storage that is accessible via the network(s) 500. Thedatabase 506 can store enhanced video books 530A-N that are generated bythe computer system 502, delivery methods 532A-N that are used todeliver the enhanced video books 530A-N for playback at playback deviceshaving different playback or delivery requirements, and delivery packets534A-N. The delivery packets 534A-N, as described herein (e.g., refer toFIG. 4 ), can include a design packet, as described throughout thisdisclosure, video, video timecode, enhanced closed captioning commands,and/or packaged audio. The delivery packets 534A-N can be rendered afirst time that an associated enhanced video book is requested forplayback by a playback device. As a result, the enhanced video book canbe quickly streamed or broadcasted to the playback device with minimalor no buffering time. Storing the delivery packets 534A-N in the EVBdatabase 506 can be advantageous so that rendering time can be reduced.Moreover, this is advantageous so that an enhanced video book need onlybe rendered at a first playback request rather than with every playbackrequest.

As an example, the playback device 504 can request the enhanced videobook to be played with Spanish subtitles. The computer system 502 canreceive this request and render the book with Spanish subtitles in therendering engine 526. The rendered enhanced video book file can then becommunicated over the network(s) 500, through the content deliverysystem 550, and to the playback device 504. The playback device 504 canthen play the enhanced video book with Spanish subtitles. Once thecomputer system 502 renders the enhanced video book with Spanishsubtitles, the computer system 502 can store it in the EVB database 506.Therefore, whenever any subsequent playback devices request the enhancedvideo book with Spanish subtitles, the computer system 502 can quicklyand easily retrieve the already rendered enhanced video book withSpanish subtitles from the EVB database 506 and provide that to theplayback device. This can provide for faster streaming and/orbroadcasting, reduced and/or non-existent buffering, and reduced timerendering the enhanced video book. In other words, the enhanced videobook does not have to be rendered every time that it is requested forplayback at a playback device.

Still referring to FIG. 5 , the computer system 502 can communicate withthe content delivery system 550 when delivering an enhanced video bookto the playback device 504. The content delivery system 550, asdescribed herein (e.g., refer to the content delivery network 216 inFIG. 2 ) can provide the enhanced video book (e.g., via the deliverypacket) to the playback device 504. In some implementations, the contentdelivery system 502 can render the enhanced video book for playback. Inother implementations, the content delivery system 502 can retrieve analready rendered enhanced video book from the EVB database 506 anddelivery that rendered file to the playback device 504. In yet otherimplementations, the content delivery system 550 can be part of thecomputer system 502. For example, the rendering engine 526 and thecontent delivery system 550 can be one in the same and/or subcomponentsof each other.

Still referring to FIG. 5 , the playback device 504 can include inputdevice(s) 538, output device(s) 540, processor(s) 544, and communicationinterface 546. The input device(s) 538 can be a touchscreen, keyboard,mouse, microphone, or any other similar device that is configured forreceiving user input. The output device(s) 540 can include display(s)542. The display(s) 542 can be a screen. The playback device 504 can bea tablet, laptop, computer, smartphone, e-reader, TV, augmented realitydevice, projector, or other device having video playback capabilities.The processor(s) 544 can be configured to perform one or more operationsas described throughout this disclosure (e.g., sending a playbackrequest to the computer system 502). The communication interface 546 canprovide for communication between any components of the playback device504 with any other components (e.g., the computer system 502, thecontent delivery system 550) via the network(s) 500.

The display 542 can provide a user of the playback device 504 with agraphical user interface (GUI). The GUI can include prompts requestinginput from the user. Example user input can include selection of anavailable enhanced video book for playback, selection of a pace at whichto read or play an enhanced video book, selection of a subtitle languagefor an enhanced video book, pausing an enhanced video book duringplayback, and/or stopping an enhanced video book during playback. Usingthe received user input, the processor(s) 544 can send playback requeststo the computer system 502.

FIGS. 6A-6B is a flowchart of an example process 600 for creating anenhanced video book. Any of the steps in the process 600 can beperformed by a developer (e.g., content creator) using one or more knowntechniques, editing applications (e.g., refer to FIG. 5 ), and/orsoftware applications or tools.

Referring to FIGS. 6A-6B, at 602, artwork and text can be extracted froma physical book. Extracting at least one of artwork and text from aphysical book can include scanning pages of the physical book into auser device. The extracted artwork and text can also be broken up intopieces or layers such that each piece or layer can be more easilyanimated or set in motion. As described throughout this disclosure, insome implementations, the physical book may not be received at the userdevice. Instead, a storyline (e.g., text) can be provided to the userdevice, where the storyline does not include artwork or originate from aphysical book. Therefore, the process 600 can include generating artworkand/or additional text for the received storyline. In otherimplementations where the physical book is not received at the userdevice, video or animated images can be received at the user device.Therefore, the process 600 can include generating text and/or astoryline for the received video or animated images.

At 604, the artwork and text can be converted into a format that can beanimated or set into motion. As mentioned above, converting theextracted art and text can also include breaking up the extractedartwork and text into one or more layers. The one or more layers caneach be animated or set into motion. Optionally, the developer can alsodetermine one or more pauses in at least one portion of the convertedartwork and text. The developer can also generate one or more promptsthat correspond to the one or more pauses and a storyline of theenhanced video book. As a result, when the enhanced video book is playedback at a device, a viewer can pause the book at one of the designatedpauses and review one or more prompts that correspond to the pause. Thisfeature can improve the viewer's learning and reading experiences. Thisfeature can also provide enough interactive elements in the enhancedvideo book that keep the viewer's attention and make the readingexperience captivating without distracting the viewer from completingthe enhanced video book.

At 606, a timing at which the converted artwork can be displayed can beestablished. Establishing the timing can be based on at least one of alength of the physical book, a quantity of extract artwork, and/or aquantity of extracted text.

At 608, a pace at which the converted text can be read can beestablished. The pace corresponds to timing of one or more features inthe enhanced video book, such as animation of the converted artwork.Establishing the pace can be based on at least one of an age group ofreaders of the physical book, a reader skill level, and a quantity ofextracted text. As described throughout this disclosure, the pace canalso be adjusted by a reader and/or change over time as the reader readsmore of the enhanced video book.

At 610, at least one portion of the converted artwork can be animated orset into motion. At least one animated or set into motion portion of theconverted artwork can be a character or an object (e.g., refer to FIGS.1A-1C). The developer can animate different layers of the convertedartwork and/or text using any of the techniques described herein (e.g.,ECC and/or PART). In so doing, the developer can create a moreintegrative and interactive enhanced video book without having too manydistracting elements.

At 612, at least one portion of the converted text can be animated orset into motion. One or more techniques described herein, such as inreference to ECC and/or PART, can be employed by the developer. As aresult, animated text can provide for a more interactive and engagingreading and learning experience for the reader.

At 614, voiceover narration can be generated. The voiceover narrationcorresponds to the converted text. The developer can use knowntechniques to generate the voiceover narration. Moreover, the voiceovernarration can be generated for one or more different languages. Thereader can then request the enhanced video book to be played in one ormore of the different languages. Therefore, the enhanced video book canbe read by readers having different language preferences and/or learningor reading goals or capabilities.

At 616, display of the at least one animated or set into motion portionof the converted artwork can be adjusted based on a time at which theconverted artwork can be displayed. The animated portions of theconverted artwork can also be adjusted to be aligned with a pace of thevoiceover narration or a pace at which the text would normally be read.This step can be performed to ensure that the animated artwork is notdisjointed or misaligned with one or more other components of theenhanced video book.

At 618, display of the at least one animated portion of the convertedtext can be adjusted based on the pace at which the converted text canbe read. For example, the animated text can be synchronized to displayon a screen as the text is narrated or read via the voiceover. This stepcan be performed to ensure that the reader can follow along with thetext as it is being read. As a result, the reader can improve theirreading comprehension and learning experience.

At 620, at least one animated portion of the converted text can besynchronized the with the voice-over narration. This step can optionallybe performed as part of 618. This step can also include using PART, asdescribed throughout this disclosure, to provide emphasis to words orletters as they are read. Performing this step is advantageous to ensurethat the reader can read along with the text, thereby improving thereader's reading and learning skills.

At 622, music can optionally be added to the converted artwork. Addingmusic can assist the reader in being engaged and maintaining suchinterest in a storyline of the enhanced video book. This audio can alsoassist the reader in understanding or conceptualizing differentvocabulary in the enhanced video book. The audio can be generated by thedeveloper. The audio can also be premade and retrieved from online,cloud-based, or other database services and added to the enhanced videobook.

At 624, the music can be synchronized with at least one of the at leastone animated or set into motion portion of the converted artwork, the atleast one animated portion of the converted text, and the voice-overnarration. Performing this step can provide for a more seamlessintegration of components of the enhanced video book, which can providefor a more enjoyable and captivating reading and learning experience.

At 626, sound effects can also be added. As described above withreference to the music, the sound effects can make the enhanced videobook more engaging to the reader. The sound effects can also assist thereader in understanding or conceptualizing vocabulary and/or thestoryline. The sound effects can be generated by the developer. Thesound effects can also be pre-made and retrieved from online,cloud-based, or other database services and added to the enhanced videobook.

At 628, the sound effects can be synchronized with the animated art workand the animated text, as described in reference to synchronizing themusic in 624.

At 630, the converted artwork, the converted text, the at least oneanimated or set into motion portion of the converted artwork, the atleast one animated portion of the converted text, the voice-overnarration, and the audio can be combined into an enhanced video book. Inother words, as described in reference to FIG. 4 , these components canbe combined into a design packet. The design packet can be delivered toa playback platform or device for playback.

At 632, the enhanced video book can be delivered to a user device forplayback based on a delivery method. The defined delivery method can bea streamed or broadcast delivery method. As described in reference toFIG. 4 , the enhanced video book can be encapsulated in a deliverypacket. The delivery packet can be transmitted to the user device uponreceiving a request from the user device to play the enhanced videobook. Delivery of the enhanced video book can be facilitated by acontent delivery network (e.g., refer to FIG. 2 ).

At 634, delivering the enhanced video book can include rendering theenhanced video book into a linear video file and exporting the linearvideo file based on the defined delivery method. The linear video filecan have a frame rate based on one or more specifications thatcorrespond to the defined delivery method.

Once the enhanced video book is rendered a first time, the renderedlinear video file can be stored in a database (e.g., cloud). As aresult, whenever subsequent user devices request the enhanced video bookfor playback, the rendered linear video file can be provided to thesubsequent user devices. Therefore, the enhanced video book does notneed to be rendered for every user request, which can improve streamingthe enhanced video book. In other words, the enhanced video book may notbuffer during playback at the subsequent user devices. Therefore, asdescribed throughout this disclosure, the user device can receive therendered linear video file and immediately play the file. The userdevice can have linear video playback capability and be any one of amobile phone, tablet, laptop, e-reader, TV, augmented reality device,projector, computer, or other payback device.

FIGS. 7A-7B is a flowchart of an example process 700 for animatingread-along text. The process 700 can relate to PART, as describedthroughout this disclosure (e.g., refer to FIGS. 3A-3E and FIG. 9 ).

Referring to FIGS. 7A-7B, at 702, a line of text having multiple wordscan be displayed. At 704, a pace of human speech can be identified. Thedeveloper can identify or designate a proper pace at which the line oftext should be read. As described herein, this determination can be madebased on an intended audience and/or a reading or skill level of thereader. In some implementations, each of the words can be successivelyvisually emphasized at a pace that mimics a number of syllables in eachof the words. The pace of human speech can be based on at least one ofan age group of readers, a reader skill level, and a number of words inthe multiple words. In other implementations, each of the words can besuccessively visually emphasized at a pace that is set by a reader at auser device. Each of the words can also be successively visuallyemphasized at a pace corresponding to a rate at which a user's eye movesacross a screen of a user device. The screen can display the line oftext having the words and each of the words can be successively visuallyemphasized at a pace corresponding to a speed at which a user reads theline of text.

At 706, a word from the line of text can be selected. In otherimplementations, the developer can choose to select more than one wordto be emphasized together. The developer can choose a first word thatthe developer wants to emphasize as it is being read or narrated usingvoiceover or other audio in the enhanced video book.

At 708, the selected word can be successively visually emphasized at apace of human speech. The selected word can appear in a first state(e.g., refer to FIG. 3A and FIG. 9 ). The first state can correspond toa degree of opacity that is less than 100 percent. The developer canadjust or customize the first state as well as any of the other emphasisstates described herein based on preference of the developer, an authorof a storyline of the text, and/or any other factors, as describedthroughout this disclosure.

At 710, the word can be successively visually emphasized at the pace ofhuman speech from the first state to a second state (e.g., refer to FIG.3B and FIG. 9 ). The second state can include adding an outline to theemphasized word. Displaying the emphasized word from the first state tothe second state can include incrementally increasing a degree ofopacity. In other words, as the emphasized word is being read ornarrated, the word can seamlessly transition from one state to a nextstate. This seamless transition of states can assist the reader infollowing along, knowing their position in the text, reading the text,and/or understanding pronunciation of the text. In some implementations,the outline displayed in the second state can also be a color variantthat is different from a color of the emphasized word.

At 712, the word can be successively visually emphasized from the secondstate to a third state (e.g., refer to FIG. 3C and FIG. 9 ). The thirdstate can include making the emphasized word into heavier-weighted text.Displaying the emphasized word from the second state to the third statecan also include incrementally increasing the degree of opacity.

At 714, the word can be successively visually emphasized from the thirdstate to a fourth state (e.g., refer to FIG. 3C and FIG. 9 ). The fourthstate can include depicting the emphasized word as regular text.Displaying the emphasized word from the third state to the fourth statecan also include increasing the degree of opacity to 100 percent. Inthis example process 700, four different emphasis states are described.However, one or more additional or fewer states can be used to emphasizethe words in the line of text. The developer can decide how many statesto use and what emphasis should be included for each state based on avariety of factors, as described throughout this disclosure.

Next, in 716, it can be determined whether there are more words in theline of text. If there are, then steps 706-714 can be repeated for everysubsequent word. In other words, each of the words in the line of textcan be visually emphasized one at a time. For example, a first word ofthe words can be displayed in the third state when a second word of thewords can be displayed in the second state. As another example, thefirst word of the words can be displayed from the third state to thefourth state when the second word of the words can be displayed from thesecond state to the third state and a third word of the words can bedisplayed in the second state (e.g., refer to FIGS. 3A-3E and FIG. 9 ).

If there are no more words in the line of text that can be emphasized,then it can be determined whether there are more lines of text that canbe emphasized in 718. If there are more lines of text, then steps702-716 of the process 700 can be repeated for each subsequent line oftext. For example, a second line of text having a second set of wordscan be displayed. Each of the second set of words in the second line oftext can be displayed in the first state, then successively visuallyemphasized and displayed from the first state to the second state, thethird state, the fourth state, and any additional states of thedeveloper's choosing.

If there are no more lines of text, then the process 700 can end. Inother words, the developer may have emphasized each of the lines of textfor an enhanced video book and or a portion or scene from the enhancedvideo book or other digital media format.

In some implementations, the developer can decide to emphasize one ormore letters of each of the words in a successive manner as described inthe process 700. For example, each letter of each of the words can bedisplayed in the second state, the third state, and the fourth state.Each of the letters can then be successively visually emphasized as eachof the words are dictated with voiceover narration.

FIG. 8 is a flowchart of an example process 800 for generating enhancedclosed captioning (ECC) commands. The process 800 can be used inconjunction with one or more other systems and methods described hereinto generate an enhanced video book (e.g., refer to FIG. 4 ).

At 802, a video having multiple frames can be received in a contentbuilding environment. For example, a developer can upload or import thevideo into an existing editing or software tool or application. Thevideo can include at least one of animated images and still images thatare pieced together in the frames. The video can be an enhanced videobook file, as described throughout this disclosure. The developer canupload the video that the developer would like to add enhanced closedcaptioning to.

At 804, user input defining an appearance and a location of text to bedisplayed along with the video can be received. In other words, thedeveloper can indicate where the text (e.g., ECC) should appear on ascreen relative to placement of the images in the video (e.g., refer toFIG. 4 ). The appearance and the location of text can correspond to oneor more of the frames of the video. For example, the developer canchoose to include the ECC only on some frames of the video while otherframes of the video may not have any text overlay. The user input candefine the appearance of text including at least one of a font, color,size, and emphasis of the text. The developer can also provide input tothe content building environment that includes an appearance and alocation for each word in a line of text. For example, the developer canuse PART, as described in reference to FIGS. 3A-3C, 7A-7B, and 9, tocreate read-along text.

At 806, the appearance and the location of text can be synchronized witheach of the frames of the video. The appearance and the location of textcan also be synchronized with an audio package of the video. The audiopackage can include at least one of a voiceover narration, music, andsound effects. Synchronization can be performed as described throughoutthis disclosure (e.g., refer to FIG. 2 and FIGS. 6A-6B) to provide for aseamless and interactive reading, viewing, and/or learning experience.

At 808, a design packet can be generated based on the received userinput. The design packet can include the appearance and the location oftext to be displayed along with the video. The design packet canoptionally be transcoded with the video. In some implementations, adesign packet can be generated for each enhanced video book or otherdigital media format. Therefore, the same design packet can be usedevery time that a user device requests playback of the associatedenhanced video book or other digital media format. The developer mayonly have to generate the design packet once, which increases efficiencyin generating enhanced video books or other digital media formats usingthe process 800 and any of the systems and methods described herein.

At 810, a delivery packet can be generated. The delivery packet caninclude the design packet, video, video timecode, text timecode, andenhanced closed captioning commands. The enhanced closed captioningcommands can include instructions for displaying the text along with thevideo during playback at the user device. The delivery packet can alsoinclude packaged audio. Generating the delivery packet can includerendering components of the delivery packet for playback at the userdevice. Rendering the enhanced video book or other digital media formatbefore delivery to the user device can be advantageous to improvestreaming and broadcasting and to reduce or eliminate buffering at theuser device. Therefore, the user device can more quickly and seamlesslydisplay the content for playback.

At 812, the delivery packet can be stored in a database. The database,as described herein, can be cloud-based and/or any other type of remotedata storage facility that is accessible via a network communication(e.g., wired and/or wireless). Storing the delivery packet in thedatabase is beneficial because whenever subsequent user devices requestan enhanced video book file or other digital media format that hasalready been rendered and prepared for playback, the stored deliverypacket can be retrieved and sent to the user device. Rendering is notrequired on a per-device basis. Thus, the enhanced video book or otherdigital media format can be quickly streamed or broadcasted at thesubsequent user device with minimal or no buffering.

At 814, the delivery packet can be provided to the user device forplayback. Example user devices can include a mobile phone, a laptop, atablet, an e-reader, a TV, or any other playback device. The deliverypacket can be provided to the user device upon request from the userdevice. In some implementations, the delivery packet can be parsed bythe user device. Moreover, as described above, when a second user deviceor any subsequent device requests playback of the enhanced video book orother digital media format, the associated delivery packet can beretrieved from the database and transmitted to the device for immediateplayback.

A number of implementations have been described. Nevertheless, it willbe understood that various modifications can be made without departingfrom the spirit and scope of the invention.

What is claimed is:
 1. A method for creating an enhanced video book, themethod comprising: receiving at least one of artwork and text;converting at least one of the received artwork and text into a formatthat can be animated or set into motion; establishing a timing at whichthe converted artwork can be displayed; establishing a pace at which theconverted text can be read, wherein the pace corresponds to the timing;animating or setting into motion at least one portion of the convertedartwork; animating at least one portion of the converted text;generating voice-over narration, wherein the voice-over narrationcorresponds to the converted text; adjusting display of the at least oneanimated or set into motion portion of the converted artwork based onthe timing at which the converted artwork can be displayed; adjustingdisplay of the at least one animated portion of the converted text basedon the pace at which the converted text can be read; synchronizing theat least one animated portion of the converted text with the voice-overnarration; adding audio to the converted artwork; synchronizing theaudio with at least one of the at least one animated or set into motionportion of the converted artwork, the at least one animated portion ofthe converted text, and the voice-over narration; and combining theconverted artwork, the converted text, the at least one animated or setinto motion portion of the converted artwork, the at least one animatedportion of the converted text, the voice-over narration, and the audiointo an enhanced video book.
 2. The method of claim 1, furthercomprising delivering, using a defined delivery method, the enhancedvideo book to a user device for playback.
 3. The method of claim 2,wherein delivering the enhanced video book comprises: rendering theenhanced video book into a linear video file, wherein the linear videofile has a frame rate based on one or more specifications thatcorrespond to the defined delivery method; and exporting the linearvideo file based on the defined delivery method, wherein the user devicereceives the rendered linear video file.
 4. The method of claim 2,wherein the user device (i) has linear video playback capability and(ii) is at least one of a mobile phone, a tablet, a laptop, a computer,an e-reader, a TV, a projector, and an augmented reality device.
 5. Themethod of claim 2, further comprising processing the enhanced video bookthrough a content delivery network before delivering the enhanced videobook to the user device for playback.
 6. The method of claim 2, whereinthe defined delivery method is a streaming or broadcast delivery method.7. The method of claim 1, further comprising: determining one or morepauses in at least one portion of the enhanced video book; andgenerating one or more prompts that correspond to the one or more pausesand a storyline of the enhanced video book.
 8. The method of claim 1,wherein receiving at least one of artwork and text comprises extractingthe at least one of artwork and text from a physical book by scanningpages of the physical book into a user device.
 9. The method of claim 8,wherein converting the extracted artwork and text comprises breaking upthe extracted artwork and text into one or more layers, wherein the oneor more layers can each be animated or set into motion.
 10. The methodof claim 1, wherein establishing a timing is based on at least one of alength of a storyline of the enhanced video book, a quantity ofextracted artwork, and a quantity of extracted text.
 11. The method ofclaim 1, wherein the at least one animated or set into motion portion ofthe converted artwork is a character or an object.
 12. The method ofclaim 1, wherein establishing a pace is based on at least one of an agegroup of readers of a storyline of the enhanced video book, a readerskill level, and a quantity of extracted text.
 13. The method of claim1, wherein adding audio to the converted artwork comprises creatingaudio that corresponds to at least one of a storyline of the enhancedvideo book, the converted artwork, and the converted text.
 14. Themethod of claim 1, wherein the audio is at least one of music and soundeffects.
 15. The method of claim 3, further comprising: storing therendered linear video file in a database; retrieving, from the database,the rendered linear video file based on receiving a playback requestfrom a second user device; and delivering the rendered linear video fileto the second user device for playback.
 16. A system for creating anenhanced video book, the system comprising: a first device having aplurality of editing applications, wherein the plurality of editingapplications provide a plurality of user interfaces for display at thefirst device, wherein a user provides a user input to the plurality ofuser interfaces of the plurality of editing applications, wherein thefirst device is configured to, based on the user input: receive at leastone of artwork and text; convert at least one of the received artworkand text into a format that can be animated or set into motion;establish a timing at which the converted artwork can be displayed;establish a pace at which the converted text can be read, wherein thepace corresponds to the timing; animate or set into motion at least oneportion of the converted artwork; animate at least one portion of theconverted text; generate voice-over narration, wherein the voice-overnarration corresponds to the converted text; adjust display of the atleast one animated or set into motion portion of the converted artworkbased on the timing at which the converted artwork can be displayed;adjust display of the at least one animated portion of the convertedtext based on the pace at which the converted text can be read;synchronize the at least one animated portion of the converted text withthe voice-over narration; add audio to the converted artwork;synchronize the audio with at least one of the at least one animated orset into motion portion of the converted artwork, the at least oneanimated portion of the converted text, and the voice-over narration;and combine the converted artwork, the converted text, the at least oneanimated or set into motion portion of the converted artwork, the atleast one animated portion of the converted text, the voice-overnarration, and the audio into an enhanced video book; and a seconddevice having a graphical user interface display, wherein the seconddevice is configured to: receive, from the first device, the enhancedvideo book; and play the enhanced video book on the graphical userinterface display.
 17. The system of claim 16, wherein the first deviceis further configured to: render the enhanced video book into a linearvideo file, wherein the linear video file has a frame rate based on oneor more specifications that correspond to a defined delivery method ofthe second device; and export, for playback at the second device, thelinear video file based on the defined delivery method.
 18. The systemof claim 17, wherein the first user device is further configured to:store the rendered linear video file in a database; retrieve, from thedatabase, the rendered linear video file based on receiving a playbackrequest from a third user device; and deliver the rendered linear videofile to the second user device for playback.
 19. An enhanced video bookcomprising: animated text displayed in the enhanced video book at one ormore first timecodes; animated artwork displayed in the enhanced videobook at one or more second timecodes; and voice-over narrationcorresponding to the animated text, wherein the voice-over narrationplays at the one or more first timecodes, wherein the one or more firsttimecodes and the one or more second timecodes are synchronized at apace at which the animated text can be read and the animated artwork canbe viewed.
 20. The enhanced video book of claim 19, further comprisingat least one of music or sound effects that are synchronized with theone or more first timecodes and the one or more second timecodes.